Robust distant speaker recognition based on position dependent cepstral mean normalization

نویسندگان

  • Longbiao Wang
  • Norihide Kitaoka
  • Seiichi Nakagawa
چکیده

In a distant environment, channel distortion may drastically degrade speaker recognition performance. In this paper, we propose a robust speaker recognition method based on position dependent Cepstral Mean Normalization (CMN) to compensate the channel distortion depending on the speaker position. It is shown in [1] that the position dependent CMN is robust for speech recognition in a distant environment. We extend this method to the speaker recognition and show that this method is much effective to speaker recognition. In the training stage, the system measures the transmission characteristics according to the speaker positions from some grid points to the microphone in the room and estimated the compensation parameters a priori. In the recognition stage, the system estimates the speaker position and adopts the estimated compensation parameters corresponding to the estimated position, and then the system applies the CMN to the speech and performs speaker recognition. In our past study, we proposed a new textindependent speaker recognition method by combining speakerspecific Gaussian Mixture Models (GMMs) with syllable-based HMMs adapted to the speakers by MAP [2]. The robustness of this speaker recognition method for the change of the speaking style in close-talking environment was evaluated in [2]. We integrated this method to the proposed position dependent CMN for distant speaker recognition. Our experiments showed that the proposed method improved the speaker recognition performance remarkably in a distant environment.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust distant speech recognition based on position dependent CMN

In a distant environment, channel distortion may dramatically degrade speech recognition performance. In this paper, we propose a robust speech recognition method based on position dependent Cepstral Mean Normalization (CMN). At first the system measures the transmission characteristics according to the speaker positions from some grid points in the room a priori. In the recognition stage, the ...

متن کامل

Robust distant speech recognition based on position dependent CMN using a novel multiple microphone processing technique

In a distant environment, channel distortion may drastically degrade speech recognition performances. In this paper, we propose a robust multiple microphone speech processing approach based on position dependent Cepstral Mean Normalization (CMN). In the training stage, the system measures the transmission characteristics according to the speaker positions from some grid points in the room and e...

متن کامل

Robust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM

In this paper, we propose a robust speaker recognition method based on position-dependent Cepstral Mean Normalization (CMN) to compensate for the channel distortion depending on the speaker position. In the training stage, the system measures the transmission characteristics according to the speaker positions from some grid points to the microphone in the room and estimates the compensation par...

متن کامل

Robust Speech Recognition by Combining Short-Term and Long-Term Spectrum Based Position-Dependent CMN with Conventional CMN

In a distant-talking environment, the length of channel impulse response is longer than the short-term spectral analysis window. Conventional short-term spectrum based Cepstral Mean Normalization (CMN) is therefore, not effective under these conditions. In this paper, we propose a robust speech recognition method by combining a short-term spectrum based CMN with a long-term one. We assume that ...

متن کامل

Robust Speech Recognition in Distant Environment Based on Speaker Position and Speaking Direction Detection

In a practical environment, channel distortion may severly degrade speech recognition performance. In this paper, we propose a robust speech recognition method using real-time Cepstral Mean Normalization (CMN) [1] based on speaker position and speaking direction detection. We first estimate the speaker position in a 3-D space based on the time delay of arrival (TDOA) between distinct microphone...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005